Adaptive Probabilities in Stochastic Optimization Algorithms
نویسنده
چکیده
Stochastic optimization methods have been extensively studied in recent years. In some classification scenarios such as text document categorization, unbiased methods such as uniform sampling have negative effects on the convergence rate, because of the effects of the potential outlier data points on the estimator. Consequently, it would take more iterations to converge to the optimal value for uniform sampling than non-uniform sampling, which normally allocates larger probability to samples with larger norm. Based on the intuition of non-uniform sampling, we investigate adaptive sampling in which we change the probability per iteration or per epoch during the execution of the algorithm. In this thesis, we start by introducing non-uniform sampling with Stochastic Gradient Descent (SGD), and then study the impact of different choices of stepsize on convergence rate as well as derive theorems for the convergence rate. We analyze Stochastic Dual Coordinate Ascent (SDCA) and obtain the upper bound on the number of iterations to reach a specific level of optimality. This is followed by investigation of adaptive strategies which change the probabilities according to some historical information. We provide theoretical analysis to prove that they further reduce the number of iterations needed to reach the same level of optimality. The adaptive variants of SGD and SDCA are discussed and compared to the existing counterparts. The results show that both AdaSGD and AdaSDCA outperform the existing models. We also provide two online variants which update the probabilities immediately to perform adaptive sampling without the need of additional passes through the data.
منابع مشابه
Faster Optimization through Adaptive Importance Sampling
The current state of the art stochastic optimization algorithms (SGD, SVRG, SCD, SDCA, etc.) are based on sampling one active datapoint uniformly at random in each iteration. Changing these probabilities to better reflect the importance of each datapoint is a natural and powerful idea. In this thesis we analyze Stochastic Coordinate Descent methods with fixed non-uniform and adaptive sampling. ...
متن کاملMarket Adaptive Control Function Optimization in Continuous Cover Forest Management
Economically optimal management of a continuous cover forest is considered here. Initially, there is a large number of trees of different sizes and the forest may contain several species. We want to optimize the harvest decisions over time, using continuous cover forestry, which is denoted by CCF. We maximize our objective function, the expected present value, with consideration of stochastic p...
متن کاملA Bi-objective Stochastic Optimization Model for Humanitarian Relief Chain by Using Evolutionary Algorithms
Due to the increasing amount of natural disasters such as earthquakes and floods and unnatural disasters such as war and terrorist attacks, Humanitarian Relief Chain (HRC) is taken into consideration of most countries. Besides, this paper aims to contribute humanitarian relief chains under uncertainty. In this paper, we address a humanitarian logistics network design problem including local dis...
متن کاملAdaptive search with stochastic acceptance probabilities for global optimization
We present an extension of continuous domain Simulated Annealing. Our algorithm employs a globally reaching candidate generator, adaptive stochastic acceptance probabilities, and converges in probability to the optimal value. An application to simulation-optimization problems with asymptotically diminishing errors is presented. Numerical results on a noisy protein-folding problem are included.
متن کاملThree Hybrid Metaheuristic Algorithms for Stochastic Flexible Flow Shop Scheduling Problem with Preventive Maintenance and Budget Constraint
Stochastic flexible flow shop scheduling problem (SFFSSP) is one the main focus of researchers due to the complexity arises from inherent uncertainties and also the difficulty of solving such NP-hard problems. Conventionally, in such problems each machine’s job process time may encounter uncertainty due to their relevant random behaviour. In order to examine such problems more realistically, fi...
متن کامل